7 research outputs found
Optimal Coded Diffraction Patterns for Practical Phase Retrieval
Phase retrieval, a long-established challenge for recovering a complex-valued
signal from its Fourier intensity measurements, has attracted significant
interest because of its far-flung applications in optical imaging. To enhance
accuracy, researchers introduce extra constraints to the measuring procedure by
including a random aperture mask in the optical path that randomly modulates
the light projected on the target object and gives the coded diffraction
patterns (CDP). It is known that random masks are non-bandlimited and can lead
to considerable high-frequency components in the Fourier intensity
measurements. These high-frequency components can be beyond the Nyquist
frequency of the optical system and are thus ignored by the phase retrieval
optimization algorithms, resulting in degraded reconstruction performances.
Recently, our team developed a binary green noise masking scheme that can
significantly reduce the high-frequency components in the measurement. However,
the scheme cannot be extended to generate multiple-level aperture masks. This
paper proposes a two-stage optimization algorithm to generate multi-level
random masks named that can also significantly reduce
high-frequency components in the measurements but achieve higher accuracy than
the binary masking scheme. Extensive experiments on a practical optical
platform were conducted. The results demonstrate the superiority and
practicality of the proposed over the existing masking
schemes for CDP phase retrieval
Regularity scalable image coding based on wavelet singularity detection
In this paper, we propose an adaptive algorithm for scalable wavelet image coding, which is based on the general feature, the regularity, of images. In pattern recognition or computer vision, regularity of images is estimated from the oriented wavelet coefficients and quantified by the Lipschitz exponents. To estimate the Lipschitz exponents, evaluating the interscale evolution of the wavelet transform modulus sum (WTMS) over the directional cone of influence was proven to be a better approach than tracing the wavelet transform modulus maxima (WTMM). This is because the irregular sampling nature of the WTMM complicates the reconstruction process. Moreover, examples were found to show that the WTMM representation cannot uniquely characterize a signal. It implies that the reconstruction of signal from its WTMM may not be consistently stable. Furthermore, the WTMM approach requires much more computational effort. Therefore, we use the WTMS approach to estimate the regularity of images from the separable wavelet transformed coefficients. Since we do not concern about the localization issue, we allow the decimation to occur when we evaluate the interscale evolution. After the regularity is estimated, this information is utilized in our proposed adaptive regularity scalable wavelet image coding algorithm. This algorithm can be simply embedded into any wavelet image coders, so it is compatible with the existing scalable coding techniques, such as the resolution scalable and signal-to-noise ratio (SNR) scalable coding techniques, without changing the bitstream format, but provides more scalable levels with higher peak signal-to-noise ratios (PSNRs) and lower bit rates. In comparison to the other feature-based wavelet scalable coding algorithms, the proposed algorithm outperforms them in terms of visual perception, computational complexity and coding efficienc
A total-internal-reflection-based FabryâPĂ©rot resonator for ultra-sensitive wideband ultrasound and photoacoustic applications
In photoacoustic and ultrasound imaging, optical transducers offer a unique potential to provide higher responsivity, wider bandwidths, and greatly reduced electrical and acoustic impedance mismatch when compared with piezoelectric transducers. In this paper, we propose a total-internal-reflection-based FabryâPĂ©rot resonator composed of a 12-nm-thick gold layer and a dielectric resonant cavity. The resonator uses the same Kretschmann configuration as surface plasmon resonators (SPR). The resonators were analyzed both theoretically and experimentally. The experimental results were compared with those for an SPR for benchmarking. The 1.9-ÎŒm-thick-PMMA- and 3.4-ÎŒm-thick-PDMS-based resonators demonstrated responsivities of 3.6- and 30-fold improvements compared with the SPR, respectively. The measured bandwidths for the PMMA, PDMS devices are 110Â MHz and 75Â MHz, respectively. Single-shot sensitivity of 160Â Pa is obtained for the PDMS device. The results indicate that, with the proposed resonator in imaging applications, sensitivity and the signal-to-noise ratio can be improved significantly without compromising the bandwidth
PCB Soldering Defect Inspection Using Multitask Learning under Low Data Regimes
To increase the reliability of the printed circuit board (PCB) manufacturing process, automated optical inspection is often employed for soldering defect detection. However, traditional approaches built on handcrafted features, predefined rules, or thresholds are often susceptible to the variation of the acquired imagesâ quality and give unstable performances. To solve this problem, a deep learningâbased soldering defect detection method is developed in this article. Like many realâlife deep learning applications, the number of available training samples is often limited. This creates a challenging lowâdata scenario, as deep learning typically requires massive data to perform well. To address this issue, a multitask learning model is proposed, namely, PCBMTL, that can simultaneously learn the classification and segmentation tasks under lowâdata regimes. By acquiring the segmentation knowledge, classification performance is substantially improved with few samples. To facilitate the study, a soldering defect image dataset, namely, PCBSPDefect, is built. It focuses on the dual inâline packages (DIP) at the PCB back side, DIP at the PCB front side, and flat flexible cables. Experimental results show that the proposed PCBMTL outperforms the best existing approaches by over 5â17% of average accuracy for different datasets
DeepGIN: Deep Generative Inpainting Network for Extreme Image Inpainting
International audienceThe degree of difficulty in image inpainting depends on the types and sizes of the missing parts. Existing image inpainting approaches usually encounter difficulties in completing the missing parts in the wild with pleasing visual and contextual results as they are trained for either dealing with one specific type of missing patterns (mask) or unilaterally assuming the shapes and/or sizes of the masked areas. We propose a deep generative inpainting network, named DeepGIN, to handle various types of masked images. We design a Spatial Pyramid Dilation (SPD) ResNet block to enable the use of distant features for reconstruction. We also employ Multi-Scale Self-Attention (MSSA) mechanism and Back Projection (BP) technique to enhance our inpainting results. Our Deep-GIN outperforms the state-of-the-art approaches generally, including two publicly available datasets (FFHQ and Oxford Buildings), both quantitatively and qualitatively. We also demonstrate that our model is capable of completing masked images in the wild